rank | frequency | n-gram |
---|---|---|
1 | 9392 | -ς |
2 | 4723 | -ν |
3 | 3893 | -α |
4 | 3229 | -ι |
5 | 2859 | -ε |
rank | frequency | n-gram |
---|---|---|
1 | 1421 | -ου |
2 | 1406 | -ας |
3 | 1319 | -ες |
4 | 1289 | -αι |
5 | 1262 | -ων |
rank | frequency | n-gram |
---|---|---|
1 | 1197 | -ται |
2 | 778 | -ουν |
3 | 672 | -ική |
4 | 560 | -σης |
5 | 548 | -ικό |
rank | frequency | n-gram |
---|---|---|
1 | 524 | -νται |
2 | 515 | -ουμε |
3 | 490 | -εται |
4 | 401 | -ικής |
5 | 397 | -ικές |
rank | frequency | n-gram |
---|---|---|
1 | 378 | -ονται |
2 | 211 | -οντας |
3 | 201 | -τικές |
4 | 196 | -τικών |
5 | 194 | -τικής |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings